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ABSTRACT 

A fuzzy controller is usually designed by formulating the 
knowledge of a human expert into a set of linguistic variables 
and fuzzy rules. One of the most successful methods to au- 
tomate the fuzzy controllers development process are evolu- 
tionary algorithms. In this work, we propose the Recurrent 
Fuzzy Voronoi (RFV) model, a representation for recurrent 
fuzzy systems. It is an extension of the FV model 1131 that 
extends the application domain to include temporal prob- 
lems. The FV model is a representation for fuzzy controllers 
based on Voronoi diagrams that can represent fuzzy systems 
with synergistic rules, fulfilling the e-completeness property 
and providing a simple way to introduce a priory knowledge. 
In the proposed representation, the temporal relations are 
embedded by including internal units that provide feedback 
by connecting outputs to inputs. These internal units act 
as memory elements. In the RFV model, the semantic of 
the internal units can be specified together with the apri- 
ori rules. The geometric interpretation of the rules allows 
the use of geometric genetic operators during the evolution. 
The representation and the algorithms have been validated 
in two problems in the area of system identification and evo- 
lutionary robotics. 

Categories and Subject Descriptors 

[Genetic Algorithms] 

General Terms 

Recurrent Fuzzy Control 

Keywords 

Genetic algorithms, Recurrent fuzzy systems. Fuzzy control, 
Voronoi diagrams. Evolutionary Robotics 

1. INTRODUCTION 

The development of controllers by using fuzzy logic tech- 
niques has been subject of theoretical research with many in- 



teresting successful applications produced during last years 
The main reason is that fuzzy logic controllers (FLC) pro- 
vide satisfactory performance in face of uncertainty and im- 
precision [3, while keeping an equivalence in knowledge rep- 
resentation with other methods like neural networks and au- 
tomata [^. An FLC represents a non linear model as the 
combination of a set of local linear models, where each one 
represents the dynamics of a complex system in a single 
local region Each local model is specified by a fuzzy 
rule, which defines the local region in which the rule applies 
through the membership functions used in the antecedent, 
while the consequent defines the output of the model. Most 
FLCs can be classified in two categories: the Mamdani type 
FLC where the output is computed as a combination of fuzzy 
numbers, and the Takagi-Sugeno (TS) type, where the out- 
put is defined as a linear combination of the inputs. In a TS 
type FLC with n inputs and m outputs, a typical rule has 
the following form: 
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where Xj{l < j < n) are the input variables, yj{l < j < 
m) are the output variables, A^il < j < n) are the fuzzy 
membership sets and a'ij{0 < j < n,l < k < m) are the 
real valued parameters that define the linear approximation. 
The output of the complete FLC is computed by combining 
the outputs produced by all the rules, weighted by the degree 
of satisfaction of the antecedents. 

Simple FLCs are usually defined by a trial and error process 
by using expert knowledge. However, automatic FLC gen- 
eration methods are preferred for complex control systems. 
Most FLC structures can be mapped into feed-forward neu- 
ral networks, allowing the use of neural network learning 
algorithms to automate the design of FLC based on numer- 
ical data as well as on expert knowledge. The combined 
approach provides advantages from both worlds: the low 
level learning and computational power of neural networks 
is joined together with the high level human-like thinking 
and reasoning of fuzzy systems This combination has 

been very successful and there is a large number of models 
that combines fuzzy systems with neural networks or 
even with standard PID control (211 



The domain of application of these systems is limited to 



static problems due to its feed forward network structure I14| . 
Most non linear problems in control require the processing 
of temporal sequences, or in other words, in these problems 
the output depends on the current input and previous values 
of inputs and/or outputs. A very interesting approach that 
considers small order temporal problems with fuzzy logic 
is proposed in |^. However, unless the number of delayed 
inputs and outputs is known before, it is not possible to 
define a feed forward model that can process temporal se- 
quences This is usually the case for most control prob- 
lems, where this information is not known. However, recur- 
rent structures can deal with this kind of problems. There 
is a large number of neural network models that have been 
proposed which are essentially feed forward structures with 
an extra set of units used to store previous activation values 
that are connected back to the inputs of other units. 

By considering the amount of recurrent neural network mod- 
els that have been proposed, it is not unexpected to see that 
most recurrent fuzzy systems are based on neural networks. 
For example, the model RFNN (Recurrent Fuzzy Neural 
Network) proposed in 1141 defines recurrent connections in 
the second layer of the structure, which corresponds to the 
units that codifies the membership antecedent values. The 
model RSONFIN (Recurrent Self Organizing Neural Fuzzy 
Inference Network) proposed in I10| perform structure and 
parameter learning and includes an extra layer of units with 
recurrent connections that provides a kind of internal mem- 
ory. The model DFNN (Dynamic Fuzzy Neural Network) 
proposed in I16| includes recurrent neural networks in the 
consequent in place of standard linear approximators like in 
the TS model. The TRFN (Takagi-Sugeno Type Recurrent 
Fuzzy Network) model proposed in [3| has an extra unit 
with recurrent connections for each fuzzy rule, which is re- 
sponsible of memorizing the temporal history of activations 
of the rule. Other models like 1111 and 1221 follow similar 
approaches. In most cases, both supervised learning and 
non gradient based algorithms, like genetic algorithms or 
reinforcement learning, have been used to build or enhance 
the models. 

Even if these recurrent models are successful in supporting 
learning of temporal sequences, in most cases the logic in- 
terpretation of recurrent units is not considered. In this 
work, we propose a recurrent structure for fuzzy systems 
based in the Fuzzy Voronoi (FV) method proposed in 1131 . 
which allows the definition of recurrent fuzzy systems with 
a clear interpretation of recurrent units. The proposed Re- 
current Fuzzy Voronoi (RFV) structure consists in a set of 
rules, where the antecedent of the rules are determined by 
multidimensional membership functions defined in terms of 
Voronoi regions. The RFV model includes external and in- 
ternal variables with recurrent connections that allow the 
processing of temporal sequences of arbitrary length. Ge- 
netic algorithms are proposed as a design tool, since they 
do not require derivative information, which in most control 
problems is unavailable or costly to obtain. 

This paper is organized as follows. Section |5| describes the 
structure of the RFV model and the details on the geomet- 
rical basic structure. In section|3] the design of RFV models 
with genetic algorithms is analyzed. The properties of the 
proposed representation are discussed in section 0] In sec- 



tion |3 two experiments with the RFV model are detailed. 
Finally, conclusions are presented in section |S| 

2. THE RFV MODEL 

In this section, the structure of the RFV model is presented, 
the fuzzy reasoning strategy is explained and the details 
on the computation of the membership functions are intro- 
duced. 

2.1 Structure 

A schematic diagram of the model is shown in figure^ which 
is organized in four layers and consists of I input variables, r 
internal variables, m output variables and u) rules. Units in 
layer 1 are called input units. There are two types of input 
units: external inputs and internal units that are used also 
as standard inputs in rule definition. Units in layer 2 are 
called partition units. They act as multidimensional fuzzy 
membership functions. Units in layer 3 are called rule units. 
Each fuzzy rule in the fuzzy system has a corresponding 
rule unit. There is a one to one correspondence with units 
in layer 2. Units in layer 4 are called output units. They 
compute the outputs as a weighted linear combination of 
input units, generating both the external outputs and the 
values of the internal units to be made available as inputs 
in the next time step. 

The function of each type of unit is described below. In the 
descriptions, the external input vector of size I is denoted by 
X, the internal vector of size r is denoted by h, the output 
vector of size m is denoted by y, the complete input vector 
for the rules of size I + r is denoted hy I — x : h and the 
complete output vector produced by the rules of size m + r 
is denoted hy O = v : y, where : identifies the concatenation 
operator. 

Layer 1 : No computation is performed in this layer. Ex- 
ternal input values x and previous values of internal 
units y are transmitted to the units in layer 2. 

Layer 2 : The fc-th unit in this layer computes the fuzzy 
membership value ^ts^ (/) of the input vector / to the 
multidimensional fuzzy set Sk associated to the k-ih 
rule. More details on this computations are provided 
in section 031 

Layer 3 : Units in this layer compute a linear combination 
of input values based on the parameters specified by 
each rule, weighted by the corresponding degree of ac- 
tivation, as usual in TS fuzzy systems. Note that this 
units produce m + r outputs. The output produced by 
the unit k that corresponds to the output variable i is: 

O'i = {al^ + J2al,,I,)MI) (2) 

where the a\j are the real valued parameters that com- 
pute the linear combination of input values associated 
to the rule k for the output variable i. 

Layer 4 : Units in this layer compute the output vector 
O by computing the summation of the corresponding 
outputs produced by each rule. That is: 

0« = E of (3) 

k 
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Figure 1: The structure of the RFV model 



2.2 Fuzzy reasoning model 

The RFV model performs fuzzy inference by using rules de- 
fined as follows: 



Rk 



if 



/ is Ak 



then Oi = alo + J^i aljlj 



(4) 
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It can be noted that there is a strong similarity with the 
standard definition of TS rules given in equation Except 
for the fact that now a single multidimensional set is used for 
membership, the main difference is that the fuzzy inference 
involves r terms in the input vector that are output values 
produced in the previous time step. The fuzzy system is a 
dynamic fuzzy inference system with the inferred values Vi 
produced in time t + 1 given by: 



(5) 



where the computation of the value O'iit) (see equation |5J 
involves the input vector x{t) at time t and the internal 
values y(t — 1) defined in time t — 1. 

2.3 Membership computation 

The domain partition strategy is based on Voronoi diagrams. 
A Voronoi diagram induces a subdivision of the space based 
on a set of points called sites. Formally [3, a Voronoi dia- 
gram of a set of p points V = {Pi , . . . , Pp} is the subdivision 
of the plane into p cells, one for each site in V, with the prop- 
erty that a point M lies in the cell corresponding to a site 
Pi if and only if the distance between M and Pi is smaller 
than the distance between P and all other Pj (j 7^ i). A 
related concept is the so called Delaunay triangulation T, 
defined as the maximal planar subdivision (i.e. a subdivi- 
sion such that no edge connecting two vertexes can be added 
to S without destroying its planarity) whose vertex set is V 
and such that the circumcircle of any triangle in T does not 





Figure 2: An example of a Voronoi diagram 
(left) and the corresponding Delaunay triangulation 
(right) for a set of points in B? 



contain any point of V in its interior. Figure |21 illustrates an 
example of a Voronoi diagram and its corresponding Delau- 
nay triangulation in B? . Note that these definitions can be 
straightforwardly extended to P", with 71 > 2 - all details 
can be found in j^. 

The FV representation |13l considers joint fuzzy sets defined 
from a Voronoi diagram V = {Pi,...,Pp}. There are as 
many rules as Voronoi sites. The fuzzy set Sk is defined 
as by its multivariate membership function fik that takes 
its maximum value 1 at site P^, and decreases linearly to 
reach value at the centers of all neighbor Voronoi sites. 
An example of such a joint fuzzy set is shown in figure |2Ja 
for 71 = 2. 

Formally, the membership value of the input vector / to the 
joint fuzzy set Sk is defined by: 



lc{I) x£Vk 



(6) 



elsewhere, 
where C = P^ is the Voronoi site defining Sk and the Voronoi 




(a) (b) 




(c) 

Figure 3: An example of a (a) joint fuzzy set for 
a single Voronoi region for n = 2, where the mem- 
bership value is represented in the z-axis, and a (b) 
Voronoi diagram (solid line) and its corresponding 
Delaunay triangulation (dotted line) for n — 2. The 
graphic (c) shows an example of the membership 
computation for n = 2. The outer triangle corre- 
sponds to the simplex defined by the Delaunay trian- 
gulation to which X belongs. The membership value 
corresponds to the area of the shadowed triangle. 
Note that the value of the area is 1 when x is equal 
to C and it goes down linearly to on the side of 
the triangle opposite to C 

cell Vk, and lc{i) is the barycentric coordinate of / in the 
simplex Tc{I) of the Delaunay triangulation of V that has 
C as a vertex and to which I belongs. Figure |2jb shows an 
example of the Voronoi diagram and the associated Delau- 
nay triangulation. On Figure^Jc, the barycentric coordinate 
lc{I) corresponds to the (normalized) gray area (volume if 
n > 2) of the sub-simplex formed by I and vertexes of sim- 
plex Tc{I) but C. Note that a very large triangle containing 
all points in the domain is defined in such a way that there 
are no open Voronoi regions in the input domain. 

3. RFV DESIGN WITH EVOLUTIONARY 
ALGORITHMS 

Evolutionary algorithms are selected as the optimization 
tool for RFV controller design, since they have been very 
successful on problems where training data or gradient in- 
formation is very difficult or costly to obtain, like most con- 
trol problems. A fioating point coding scheme is selected, 
where each individual (or chromosome) represents all free 
parameters of the RFV controller as a variable length vec- 
tor of floating point values. An individual / with uj rules is 
defined as the vector: 

Ind ^ Ri : . . . : (7) 

where each sub- vector Ri{l < i < u) is defined as the fioat- 
ing point vector: 

Ri = PI : . . . : P'^'' : alo : . . . : a^;+'^) (8) 

where the P? are the coordinates of the site and al-j are 
the real valued parameters associated to the rule Ri. The 
evolutionary algorithm is described in details in I2UI 1121 . 
The crossover operator is based on geometrical exchange of 
Voronoi sites between both parents with respect to a random 



hyperplane. The mutation operator can either modify the 
parameters of a particular rule by some standard Gaussian 
mutation, or add or delete a Voronoi site, i.e. a rule (see 
sectional- Practical details on the algorithms, including all 
parameters, will be given in section |S] But before experi- 
mentally validating the FV representation, next section will 
discuss some of its properties. 

4. PROPERTIES 

First of all, the RFV representation belongs to the class of 
approximative representations 0, where each fuzzy rule de- 
fines its own fuzzy sets. It also provides continuous output, 
as most fuzzy systems. However, it also has a number of 
useful properties, that we shall now discuss in turn. 

e-completeness property: All RFV-based fuzzy systems 
defined with the RFV representation fulfills the complete- 
ness property at any required level, which establishes that 
any input must belong to at least one fuzzy set with a mem- 
bership value not smaller than a threshold value e: 

Vx€U3A€{Ai,...,A„}fiA{x)^€. (9) 

For the RFV representation, it is clear from the definition of 
the membership function of equation (|SJ that this property 
will hold with e = i, as lc{x) will be above 0.5 if x lies in 
the Voronoi cell defined by C. 

This property guarantees an adequate representation for ev- 
ery input point, since there is always a rule that is applied 
with at least a known value of membership. 

No need for genetic repair algorithms: Since it is not 
possible to define wrong or non complete fuzzy systems, the 
fuzzy systems produced by applying mutation or crossover 
operators are always valid control systems. 

Adaptive fuzzy rules: The influence on the output of 
a particular fuzzy rule in the RFV representation does not 
only depend on the rule itself, it also depends on all neighbor 
rules. The area of application Ak of a fuzzy rule Rk is 
defined as the union of all Delaunay regions which contain 
the point Pk, center of the rule R^. Formally: 

A{Rk)^ U D, Dj e D^{Di,...,D^}. (10) 

where Pk is the center of the rule Rk and D = {Di, . . . , Dry} 
is the Delaunay partition of the set V = {Pi, . . . ,Pp}. Fig- 
ure|31shows an example of the application area of some rules 
in a regular partition, and illustrates the interdependency of 
application areas of neighboring rules when some rules are 
removed or added. 

The evolutionary algorithm evolves individuals that repre- 
sent complete fuzzy systems defined by a set of fuzzy rules 
that are synergistically related, and not fuzzy systems de- 
fined with a set of independent fuzzy rules. The variation op- 
erators hence modify the application areas of all fuzzy rules, 
while still maintaining the required e-completeness level. 



(c) 



(d) 



Figure 4: Diagram (a) shows the application area 
of a fuzzy rule. Diagram (b) shows the application 
area of one of its neighbor rule. Diagram (c) shows 
the application area of the rule of diagram (a) when 
the rule of diagram (b) is removed, and diagram (d) 
the application area of the rule of diagram (a) when 
a rule is added between both rules. 



Adaptive apriori rules: In most fuzzy systems, the user 
can incorporate apriori knowledge by manually defining fuzzy 
sets and the corresponding fuzzy rules. This process implies 
that some restriction on the output values and the partition 
of the input space is introduced in the evolutionary process, 
but the expected benefit is that the evolutionary process, 
biased toward hopefully good parts of its search space, will 
converge faster to better solutions. 

Similarly, the RFV representation allows the definition of 
apriori rules, i.e. fixed Voronoi sites that will not be modi- 
fied by evolution. But one big advantages of the RFV rep- 
resentation is that the expert does not need to specify the 
application area of such rules: thanks to the synergistic ef- 
fect described above, the evolutionary process, by adding 
rules more or less close to the apriori rules will also tune its 
domain of application - as will be clear on the experimental 
results in section]^ 



Recurrent rules: The rules defined in the RFV controller 
are standard TS type fuzzy rules, with their own inputs and 
outputs. The complete system is recurrent because some 
outputs are connected to inputs, but each rule by itself is a 
standard TS type fuzzy rule. This fact contributes to pro- 
vide a clear interpretation of the rules and make easy to 
define the apriori rules for the RFV controller. This ap- 
proach contrasts with other models like RFNN RSON- 
FIN Uni, DFNN [IS] or the TRFN 0, where the rules them- 
selves include backward connections. The recurrent con- 
nection model is similar to the NFSLS approach proposed 
in 1181 . except that only a single output is considered and 
standard fuzzy partition is performed in the input domain. 
Sectionl^will introduce examples that show that this way of 
defining recurrent rules allows easy introduction of apriori 
knowledge. 



Figure 5: The control signal in the system identifi- 
cation problem. 



5. EXPERIMENTS 

In this section, the evolutionary approach to design RFV 
systems is evaluated in two problems. The first one is a 
system identification problem, where the outputs of the sys- 
tem are function of past inputs and outputs. This problem 
is introduced in order to compare the approach with other 
methods. The second problem is more interesting: it is an 
evolutionary robotic problem 1191 . where the ability to in- 
troduce apriori knowledge in the form of recursive rules is 
demonstrated. 

5.1 System identification 

The controlled plant is the same as used in the example 3 
in [51 and is given by: 

where yp{t) and u{t) are respectively the output and the 
input at time t. The desired output is defined by 250 pieces 
of data obtained from: 



yr{t +1) = 0.6yr{t) + 0.2yr{t - 1) + 0.2sm(27rt/25) 
H-0.4sm(7rfc/32) 



(12) 

The test signal used for evaluation is shown in figure |K| 

In the experiments, the population size is set to 50, the prob- 
ability of Voronoi crossover is set to 0.8, Voronoi mutation 
to 0.3, mutation for addition and removal of Voronoi sites 
to 0.1, selection is performed by tournament, elitism is used 
and the number of generations is set to 1200. The fitness is 
defined as the RMS error. The results for the best and aver- 
age RMS error over 50 runs are listed in tabled The table 
shows also the results obtained with the TRFN and RFNN 
models as presented in The comparison has to be con- 
sidered with extreme care since, even if a careful selection 
of parameters was performed to replicate the experiments, 
there are some important differences in the models. For 
example, the RFV model uses variable length individuals, 
while the other methods use fixed length individuals. The 
main implication is that the number of fuzzy rules in the 
RFV model is determined by evolution, while in the other 
models has to be defined in advance. The number of rules 
in the other experiments is set to 4, which also determine 



Table 1: RMS error for the system identification 
experiment with one internal unit 



RFNN+GA 


TRFN+GA 


RFV 


mean 


best 


mean 


best 


mean 


best 


0.3911 


0.0850 


0.0910 


0.0536 


0.0775 


0.0235 



the number of hidden units, since there is a one to one cor- 
respondence between hidden units and fuzzy rules. In the 
RFV model, the number of hidden units is independent and 
it is set to 1. Results in CPU time are not provided (even if 
they are provided in |3) since there is no information on the 
hardware used to run the experiments and the comparisons 
will not be fair. 

5.2 Evolutionary robotics 

A problem defined in the area of evolutionary robotics |19) 
has been selected to validate the RFV model. As a test base 
for experiments, a simulated Khepera robot (171 was used 
for experimentation. A Khepera robot has 8 sensors that 
can be used to measure proximity of objects and ambient 
light levels, and two independent motors to control the speed 
and direction of the robot. The problem consists in drive 
the robot avoiding collisions, starting from a fixed initial 
position, to a target position that depends on light based 
signals that are set to on or off status in the trajectory. The 
presence of an illuminated signal (on status) indicates to the 
robot that it has to turn left in the next intersection, and 
its absence (or off status) that it has to turn right. The 
controller needs internal memory, since the light signal is 
not present in the intersection, but in a previous (and may 
be distant) point in the trajectory. The controller has to 
learn also to forget light signals that affected the behavior 
in previous intersections and have not to be considered in 
other point of the trajectory. 

The fitness of a RFV controller is computed in a similar 
way as in 1191 . evaluating the controller in e different sce- 
narios. Each scenery defines initial and target positions, and 
include path intersections where light signals determine the 
expected trajectory of the robot. The fitness is accumulated 
at every step of the robot proportionally to the speed, in- 
versely proportional to the distance to the target point and 
reduced when the robot travels near obstacles, in order to fa- 
vor navigation without collisions. The fitness accumulation 
is stopped when the robot bumps an obstacle, or it reaches 
a maximum number of steps s. The total fitness is the aver- 
age of the values obtained in the e scenarios. Formally, the 
fitness is defined as follows: 

fitness(/) ^ * ~ * " '^(^^^ (^"^^ 

i = l t = l 

where t is the time step, v{t) is the normalized forward speed 
(summation of the speed of both motors), a{t) is the normal- 
ized maximum activation of the sensors 1191 (for example, 
a{t) — 1 implies a collision) and d{t) is the normalized dis- 
tance to the destination point (for example, d{t) — implies 
that the target has been reached) . This function assigns 
larger values to individuals that travel at the highest speed, 
in a trajectory that follows (when possible) a straight line, 
as far as possible to obstacles and minimizing the distance 
to the target point. 




(b) 

Figure 6: The performance of the best controller 
(a) without apriori knowledge and (b) with apri- 
ori knowledge when evaluated in a scenery not used 
during evolution. 

The controllers are defined with five inputs, two outputs 
and one internal variable. The inputs are, respectively, the 
average of the two left sensors, the two front sensors, the two 
right sensors, the two back sensors and an average of ambient 
light as measured by all sensors. The outputs correspond to 
the speed of the two motors. Note that the presence of 
an internal variable forces the rules to be defined with six 
inputs and three outputs (see figure 0. In the experiments, 
the population size is set to 50, the probability of Voronoi 
crossover is set to 0.8, Voronoi mutation to 0.3, mutation 
for addition and removal of Voronoi sites to 0.1, selection is 
performed by tournament, elitism is used and the number of 
generations is set to 200. The performance of the individuals 
is measured in e = 4 scenarios with three intersections and 
all combinations of light signals, evaluated in at most s = 
500 time steps. 

The experiments were performed also by using apriori knowl- 
edge. The rules defined beforehand and inserted as ex- 
plained in section 3] are shown in table |21 The semantic 
of the rules is defined by considering the internal variable 
j/i as a fiag that indicates if a light signal was seen before. 
The first four rules correspond to the situation where there 



Rule 


site 


"1 


V2 


yi 




L,C,R,B,G,yi 




ao,ax,a2, 03, 04,05,06 


09,01,02,03,04,05,06 


Ri 


0,0,0,0,0,0 


1,0,0,0,0,0,0 


1,0,0,0,0,0,0 


0,0,0,0,0,0,0 


R2 


0,0,0,0,0,1 


1,0,0,0,0,0,0 


1,0,0,0,0,0,0 


1,0,0,0,0,0,0 


R3 


0,0,0,0,1,0 


1,0,0,0,0,0,0 


1,0,0,0,0,0,0 


1,0,0,0,0,0,0 


Ri 


0,0,0,0,1,1 


1,0,0,0,0,0,0 


1,0,0,0,0,0,0 


1,0,0,0,0,0,0 


R5 


0,1,0,0,0,0 


1,0,0,0,0,0,0 


0,0,0,0,0,0,0 


0,0,0,0,0,0,0 


Re 


0,1,0,0,0,1 


0,0,0,0,0,0,0 


1,0,0,0,0,0,0 


0,0,0,0,0,0,0 



Table 2: Apriori rules. The value of the site corresponds to the center of the Voronoi region defined by 
the rule. It is specified by the normalized values of the left (L), center (C), right (R), back (B) and light 
(G) sensors, and the internal variable yi. The values oj correspond to the parameters used to define the 
approximators. 



Table 3: Fitness for the evolutionary robot experi- 
ment 



RFV 


RFV + apriori knowledge 


mean 


best 


var 


mean 


best 


var 


0.7728 


0.8723 


0.0686 


0.8510 


0.8835 


0.0255 



are no obstacles near the robot (all distance sensor values 
are equal to 0). The output produced in all cases for the 
motors is maximum forward speed (note the constant term 
of the approximator is 1 for both motor outputs vi and V2). 
The value of the internal variable j/i is set to 1 when light 
is present (rules 3 and 4) and to the previous value (can 
be or 1) if no light is measured (rules 1 and 2). Rule 5 
produces a turn to the right (left motor at maximum speed) 
if no light was detected before (yi = 0) and rule 6 a turn 
to the left (right motor at maximum speed) if light was de- 
tected (2/1 = 1). In both cases, the flag (internal variable 
yi) is reset to 0. It is important to note that the apriori 
knowledge is defined by specifying rules that determine the 
expected behavior of the controller in specific points in the 
input domain, without specifying the area of application of 
the rules, as it was detailed in section 0] 

The results for the best and average fitness over 10 runs are 
listed in tabled where the best possible value for fitness is 
1 and is the worst. Smaller error is achieved with apriori 
rules, but also the standard deviation is smaller, meaning 
that it is a more robust approach. 

The figure]^ shows the performance of the best controllers 
found during evolution in a scenery not used during evolu- 
tion. The controllers are evaluated for 800 time steps. The 
controller that do not use apriori knowledge can drive the 
robot for a longer distance in the same number of time steps, 
performing not so abrupt turns in the intersections but both 
controUers can drive the robots by following the light signals 
as expected. 

However, the most important point is that a definite seman- 
tic interpretation of the hidden unit is provided with the 
apriori rules: the hidden unit behavior indicates if light was 
or not detected before the intersection. There is no guar- 
antee that a clear semantic is provided with the approach 
without apriori knowledge. Figure shows the value of the 
hidden unit of both best controllers plotted for the 800 time 
steps when evaluated on the test scenery from figure El The 
value of the hidden unit for the controller evolved with apri- 
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Figure 7: Value of the hidden unit of the best con- 
troller obtained through evolution (a) without and 
(b) with apriori knowledge when evaluated on the 
test scenery. 



ori knowledge represent the expected semantics, with two 
peaks on the areas where light signals were detected. No 
clear semantics can be defined in the case of the controller 
evolved without apriori knowledge. 

6. CONCLUSIONS 

In this paper, the RFV model has been proposed. This 
model is an extension of the FV model defined to extend the 
application domain to include temporal problems. The tem- 
poral relations are embedded by including internal units that 
provide feedback by connecting outputs to inputs. These 
internal units act as memory elements. This paper propose 
the use of genetic algorithms as a design tool of the RFV 
model. The controllers are represented by following the FV 
model, which is a representation for fuzzy controllers based 
on Voronoi diagrams that can represent fuzzy systems with 
synergistic rules, fulfilling the e-completeness property and 
providing a simple way to introduce apriori knowledge. The 
geometric interpretation of the rules allows the use of ge- 
ometric genetic operators that proved to be useful also in 
other contexts. The main benefit of the proposed represen- 
tation is the possibility to provide a definite semantic to the 
internal (or recurrent) units. The representation and the 
algorithms have been validated in two problems in the area 
of system identification and evolutionary robotics. Future 
work include experiments on a real mobile robot and the 
study of the impact of using the so-called Symbolic Con- 
trollers approach 
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